1191  645 


iT7-Ti4  m  i  I  f-PV; 


CELL  FREOUENCIES 
OF  STATISTICS  ft 


- 1  coHPONtNt  «AWl4  uITh  urfMl 

[  (U)  FLORIDA  UNIV  GAINESVILLE  DEPT 

[  KHURI  JAN  88  TR-29G  NM814-88-K-M59 
F/G  12/3 


TT* 

NL 


UNCLASSIFIED 


AD-A191  645 


q 

i 

j 

OTIC  FILE  COPJ 


Exact  Testa  for  Variance  Component  Models  With 
Unequal  Cell  Frequencies  in  the  Last  Stage 

by 


A.  I.  Khuri 

Department  of  Statistics 
University  of  Florida 
Gainesville,  FL  32611 


Technical  Report  Number  296 
January  1988 


DTIC 


ELECTS 

FEB  1  1 1988 


PREPARED  UNDER  GRANT  NO.  N00014-86-K-0059  FROM  THE  OFFICE  OF 
NAVAL  RESEARCH.  ANDRE  I.  KHURI  AND  RAMON  C.  LITTELL,  PRINCIPAL 
INVESTIGATORS 


iSttTMBUTION  STATEMENT  A 
Approved  for  pubUc  ralocnej 


88  Z  05  olo 


SECURITY  CLASSIFICATION  of  This  PAGE  (Whan  Data  imw»<| 


REPORT  DOCUMENTATION  PAGE 

READ  INSTRUCTIONS 

BEFORE  COMPLETING  FORM 

1.  r! Fo'lSt  NUMBER  *.  COVT  ACCESSION  NO. 

296 

S.  RECIPIENT'S  CATALOG  NUMBER 

4.  TITLE  (And  Subtitle) 

Exact  Tests  for  Variance  Component  Models  with 
Unequal  Cell  Frequencies  in  the  Last  Stage 

S.  TYPE  OF  REPORT  *  PERIOD  COVERED 

4.  PERFORMING  ORG.  REPORT  NUMBER 

7.  AUTMORO) 

A.  1.  Khuri 

S-  CONTRACT  OR  GRANT  NUMBER^ 

N00014-86-K-0059 

R  &  T  4114552 - 01 

Acct.  No.  49101623459 

s.  performing  ORGANIZATION  NAME  ANO  AOORESS 

Department  of  Statistics 

496  Little  Hall,  University  of  Florida 
Gainesville,  FL  32611 

to.  PROGRAM  ELEMENT. PROJECT,  TASK 
AREA  *  WORK  UNIT  NUMBERS 

It.  CONTROLLING  OFFICE  NAME  ANO  AOORESS 

Office  of  Naval  Research 

Mathematical  Sciences  Division 

Arllnoron.  VA  7-1000 

12  REPORT  DATE 

January  1988 

IS.  NUMBER  OF  PAGES 

21 

14.  MONITORING  AGENCY  NAME  4  ADDRESS^'  dlllatanl  tram  Controlling  Olllca) 

IS.  security  CLASS,  (al  Oita  taper*) 

Unclassified 

IS*.  DECLASSIFICATION' DOWNGRADING 
SCHEDULE 

14.  DISTRIBUTION  STATEMENT  (a  1  fill ,  Pop offj 

Approved  for  Public  Release:  Distribution  Unlimited 

17.  DISTRIBUTION  STATEMENT  (of  ffio  abelract  anrfrad  In  Black  10.  II  dlllarani  ham  Report)  ^ 

-  y  _  -  -  - - 

18.  SUPPLEMENTARY  N^tES  \ 

\ 

/ 

/ 

It.  KEY  *Of6s/€ontlnua  on  eovorao  oldo  II  nacoaaory  and  Identity  by  block  numbor) 

Unbalanced  random  models;  Hypothesis  testing;  Power-o i  exact  tests;  Nested^ 
and  crossed  classification  models, 

[ 

2b  ABSTRACT  (Continue  on  rovoroo  oldo  It  nacaeaary  and  Identity  by  black  number) 

& 

This  paper  presents  a  method  to  construct  exact  tests  for  the  variance  com¬ 
ponents  in  a  general  unbalanced  random  model.  It  is  assumed  that  the  im¬ 
balance  affects  only  the  last  stage  of  the  design  with  no  missing  cells. 

- Unclassified - - — - — —  _ 

SECURITY  CLASSIFICATION  OF  this  PAOE  fWM»  Data  Bnlarad) 


DO  I  j*N*7l  1473  EOiTION  OF  I  NOV  SS  IS  OBSOLETE 

5  N  0102-  LF.  014-  660 1 


EXACT  TESTS  FOR  VARIANCE  COMPONENT  MODELS  WITH  UNEQUAL  CELL 
FREQUENCIES  IN  THE  LAST  STAGE 

Andre  I.  KHURI 

Department  of  Statistics,  University  of  Florida,  Gainesville,  FL  32611,  USA 

Abstract:  Khuri  and  Littell  (1987)  derived  exact  tests  for  testing  hypotheses  concerning  the 
variance  components  in  an  unbalanced  random  two-way  model.  The  method  used  in  the 
development  of  these  tests  can  be  extended  to  more  general  unbalanced  random  models.  In  this 
article,  such  an  extension  is  established  for  models  that  are  unbalanced  only  with  respect  to  the 
last  stage  of  their  associated  designs.  A  numerical  example  is  given  to  illustrate  the 
implementation  of  the  proposed  methodology. 

AMS  Subject  Classification:  Primary  62J10;  Secondary  62F03. 

Key  words  and  phrases:  Unbalanced  random  models;  Hypothesis  testing;  Power  of  exact  tests; 
Nested  and  crossed  classification  models. 


1.  Introduction 

The  traditional  analysis  of  data  from  an  unbalanced  random  model  uses  approximate  F  tests 
that  are  based  on  sums  of  squares,  which,  in  general,  are  neither  independent  nor  distributed  as  scaled 
chi-square  variates.  The  true  critical  values  and  power  functions  of  these  tests  are  unknown  and  *0* 
depend  on  variance  components  other  than  those  under  consideration.  Furthermore,  the  tests, 

1 


particularly  those  that  depend  on  Satterthwaite’s  procedure,  can  be  quite  unreliable.  This  was  on. 


demonstrated  by  Tietjen  ( 1974),  in  the  case  of  the  unbalanced  random  two-fold  nested  model,  and  by 


Cummings  and  Gaylor  (1974). 
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The  method  developed  by  Khuri  and  Littell  (1987)  produced  exact  tests  for  the  variance 
components  in  an  unbalanced  random  two-way  classification  with  interaction  model.  This  method  is 
based  on  a  particular  transformation  that  reduces  the  analysis  of  the  unbalanced  model  to  that  of  a 
balanced  one.  In  this  article,  a  demonstration  is  given  of  the  applicability  of  the  same  kind  of 
transformation  to  any  unbalanced  random  model  provided  that  the  imbalance  occurs  only  in  the  last 
stage  of  the  associated  design. 


2.  Notation  and  preliminaries 

The  model  for  a  general  unbalanced  design  whose  imbalance  is  caused  by  unequal  cell 
frequencies  in  the  last  stage  can  be  written  in  the  form 


y0  +  (0' 


(2.1) 


where  6  =  (k  p  k2>  . . ks)  is  a  complete  set  of  subscripts  that  identify  a  typical  response  y.  The  term, 

1  g  .,  denotes  the  i^  effect  in  the  model,  where  0j  and  0j  are,  respectively,  the  corresponding  sets  of 

rightmost  and  nonrightmost  bracket  subscripts  ^see  Section  2  in  Khuri  (1982)^.  By  definition, 

0j  =  =  <j>,  the  empty  set,  for  i=0  and  the  corresponding  y  is  the  grand  mean,  usually  denoted  by  ft. 

It  is  assumed  that  y  s  ,  for  i=l,2,. ,.,u,  and  in  are  independent  and  normally  distributed  random 
V*-,)  v 

2  2  2  2 

variables  with  zero  means  and  variances  <r |,  cr rj,  ....  a p,  and  a £,  respectively. 

Since  the  design  is  balanced  except  for  its  last  stage,  the  ranges  of  subscripts  kp  k2<  ....  ks  can 
be  expressed  in  the  form 


1,2,...,  a.,  for  j=l,2,...,s-l 

k  =  J 

j  1,  2,  ...,  nr,  for  j=s, 


(2.2) 


where  r  is  a  subset  of  9  consisting  of  the  first  s-1  subscripts,  that  is, 


r  =  (k^  k2 . ks  l). 


(2.3) 


Except  for  Cg,  all  the  effects  on  the  right  side  of  (2.1)  are  indexed  by  the  subscripts  in  r.  Let  T  be  the 


set  of  all  (s-l)-tuples  as  in  (2.3),  and  let  c  denote  the  number  of  elements  in  T,  that  is, 


T  -  {r  -  (kj,  k2,  ...,  k^j)  :  kj  _  1,  2 . aj;  j  -  1,  2,  ...,  s-l} 


5-1 

c  =  n  « j- 

j=l  J 


It  is  assumed  that  nr  >  1  for  all  rcT  and,  for  reasons  to  be  seen  later,  that 


N  >  2c- 1, 


where  N  =  £  nr  is  the  total  number  of  observations.  Let  ip-  =  (0-  :  0  )  be  the  set  of  subscripts 

T( T  111 


associated  with  the  itn  effect,  which  results  from  combining  the  elements  of  and  0 j  (i=0,l, ...,»/). 
The  complement  of  ip^  with  respect  to  t  is  denoted  by  ip ?  (i=0,l,.  ..,u). 


We  note  that  if  it  were  not  for  tg,  the  model  in  (2.1)  would  be  of  the  same  form  as  that  of  a 
balanced  model  ^see  Khuri  (1982)  for  a  general  representation  of  a  balanced  model  j.  This  fact  will  be 


quite  useful  in  the  development  of  the  exact  tests  in  Section  3. 


The  model  in  (2.1)  can  be  written  in  matrix  form  as 


y  =  £  x./?i  +  e, 

i=0 


where  y  and  f  are  the  vectors  of  observations  and  random  errors,  respectively,  Xj  is  a  matrix  of  zeros 


‘.A 


and  ones  of  order  Nxcj  (i=0,l,...,i/)  with  Xq  being  equal  to  1^,  the  vector  of  ones  of  order  Nxl,  and 

P:  is  a  vector  consisting  of  the  c-  elements  of  7  _  3  ..  The  integer  c-  is  given  by 
-■  1  1 


cj  =  n  ai>  ‘  =  °>  !<  •> v' 

k-  €  <>■  J 

j 


where,  if  we  recall,  is  the  set  of  subscripts  associated  with  the  effect.  Note  that  Cj  —  1  for  i  —  0. 


Let  yT  denote  the  mean  of  averaged  over  the  range  of  subscript  ks  for  a  given 


r  =  (kj,  k2,  k^j)  in  T,  that  is, 


yr  =  ifc£y*.  rfT- 


From  (2.1)  we  have 


,r=  av.)+i" 


(2.10) 


f  nr  \  .  . 

where  e  r  =  f  £  (0)/nr-  Formula  (2.10)  may  be  written  in  matrix 

ks=l 


y  =  E«i  f,  +  i. 

i=0 


(2.11) 


1/ 

where  H-  is  a  matrix  of  order  cxc-.  Since  52  7-  ,«  >  in  (2.10)  is  of  the  same  form  as  in  a  balanced 
’•  1  i=0  ”i'”i' 

model,  the  matrix  Hj  can  be  expressed  as  a  direct  product  of  the  form  ^see  Khuri  (1982),  p.  2908^ 


Hj  =  ®  Lj£,  i  =  0,  1,  ...,  v, 


(2.12) 


where  Lj£  is  given  by 


Y*YAvV. 

.»*».>  > .iW.  iS't*, , 


*  *  I  1.'  1  ' 


(2.13) 


_  -ag*  i  =  0,  1,  ....  v\ 

*i£  “  Ia£,  kg  €  t/f  t  =  1<  2 . s-1. 


In  (2.13),  Ia^  and  la^  denote,  respectively,  the  identity  matrix  of  order  agxa£  and  the  vector  of  ones  of 
order  a^xl.  We  recall  that  in  (2.13)  is  the  complement  of  %l> j  with  respect  to  r.  Let  Aj  =  H-Hj, 
then 


s-1 


i  =  0, 


where 


(2.14) 


= 


!a.»  kj  G  V’; 


i  =  0,  1 . u; 


-ia.*  kj  £  (I.  £  —  1,  2,  ...,  s-1. 


(2.15) 


In  (2.15),  Ja^  denotes  the  matrix  of  ones  of  order  a^xa^. 

The  variance-covariance  matrix  of  y  in  (2.11)  can  now  be  written  as 


Var(y)  =  £  <r?A.  +  <r?K, 
*  i=l 


(2.16) 


where 


K  =  diag(jfj:)reT- 


(2.17) 


The  right  side  of  (2.17)  denotes  a  diagonal  matrix  of  order  cxc.  Its  diagonal  elements  are  the 

reciprocals  of  the  nr’s  for  reT,  the  set  of  all  values  of  r  in  (2.3).  It  can  be  verified  that  AjA.j  =  A  ,A. 
for  It  follows  that  there  exists  an  orthogonal  matrix,  Q,  of  order  cxc  such  that 


QAjQ'  =  Aj,  i=0,  1,  ...,!/, 


(2.18) 


where  Aj  is  a  diagonal  matrix.  The  construction  of  the  matrix  Q  will  be  described  in  Section  3. 

-5- 


3.  The  development  of  the  exact  tests 


Let  us  again  consider  model  (2.10).  As  was  noted  earlier  in  Section  2,  the  first  part  to  the 

v 

right  of  this  model,  namelv,  £  ,5  v  is  not  affected  by  the  imbalance  in  the  last  stage  of  the  design, 

i=0  W 

that  is,  in  the  r-cells,  where  r  is  described  in  (2.3).  Let  us  therefore  consider  the  derived  model 


—  E  y a  (a  V 

i=o  eS9\> 


(3.1) 


Model  (3.1)  is  balanced  with  one  observation  in  each  r-cell.  Let  Pj  be  a  cxc  matrix  associated  with 
the  sum  of  squares  for  the  i1*1  effect  (i=0,l,...,t/).  From  Khuri  (1982)  we  have  the  following  lemmas: 
Lemma  3.1. 

(i)  Pj  is  idempotent  (i=0,l,. . .,*/)• 

(ii)  PjP.,  =  Q  for  i^i*. 

(iii)  E  P;  =  Ic- 

i=0 

Lemma  3.2.  The  matrix  Pj  can  be  expressed  in  terms  of  the  Aj’s  in  (2.14)  as 


=|0(vbiK  * 


(3.2) 


where  Ajj  is  a  known  constant  equal  to  the  coefficient  of  the  admissible  mean  in  the  i^  component 
for  the  balanced  model  in  (3.1)  ^possible  values  of  Ajj  are  -1,0,  and  1^,  and  bj  is  given  by 


b.  =  II  a£-  J  =  1. 


(3.3) 


where  v-"  is  the  complement  of  i/>.  with  respect  to  r. 
J  J 


Lemma  3.3.  The  Aj  and  Pj  matrices  in  (3.2)  are  related  by  the  following  identity: 

-6- 


.  D  D  i  =  0,  1 . v\ 

4^-^,  j  =  0,  1,  ....  i/, 


where  Kjj  is  given  by 


(3.4) 


0, 

=  bjt 


0jC0j, 


(3.5) 


and  V’j  is  the  set  of  subscripts  associated  with  the  i1*1  effect  in  (3.1),  i  =  0,  1,  ....  v. 

V 

Let  m-  be  the  rank  of  P-  (i=0, 1,. .  .,*'),  then  by  Lemma  3.1,  £  =  c.  Let  Q.  be  a  matrix  of 

1  i=0 

order  mjxc  and  rank  m.  whose  rows  are  orthonormal  and  span  the  row  space  of  Pj  (i=0, 1,. .  .,*>').  The 

matrix  Q-(  can  be  easily  obtained  as  the  result  of  a  Gram-Schmidt  orthonormal  factorization  of  the 

rows  (or  columns)  of  Pj.  The  proof  of  the  following  lemma  is  given  in  Appendix  A: 

Lemma  3.4.  The  matrices  Qq,  Qj,  ...,  Q„  have  the  following  properties: 

(i)  9o  =  Ic/^- 


(ii) 


Qj  Q[  —  Imji  i  —  0,  1,  ...,  v 
Q.Q',  =  0,  i*i'. 


0.  i  =  0,  1 . 

'j->  bjQ|,  i/jj  C  0j  j  —  0,  1,  ...,  v. 


Let  us  now  define  the  matrix 


9^[9o:9i:  - 

which  is  of  order  cxc.  By  Lemma  3.1.  9  >s  an  orthogonal  matrix  and  diagonalizes  Aq,  Aj, 
simultaneously  as  in  (2.18).  From  (2.16)  we  have 

Var(9i  y)  =  9i^£ffj?Aj  +  «t?k)9[,  i  =  1,  2,  ....  t/, 


(3.6) 


A|/ 


=J>?(9iAj9Q  +  »?(9i«9[) 


i  =  1,  2, 


(3.7) 


where  y  is  the  vector  of  all  r-cell  means  defined  in  (2.11). 


But,  by  Lemma  3.4  ((ii),  (iii)  j 


£  ^(QiAjQj')  =  ^Im-.  i  =  1-2’  -1'’ 


(3.8) 


where 


6i  =  E  bfl 

jeW;  3  3 


(3.9) 


In  (3.9),  Wj  is  the  set 

W;  =  {j  :  </>i  C  l<j  <  v}. 

From  (3.7)  and  (3.8)  we  obtain 

Var(9i?)  =  +«re(9iKQ;),  »  =  1 .  2,  ..,«/• 

Furthermore,  for  i^  if, 

Cov(9i?’  ?9[')=  9\(t*fc  +  4k)q[, 


(3.10) 


=  4(9iKQ[,). 


(3.11) 


Let  u  be  a  vector  of  order  (c-l)xl  defined  as 


u  =  [9i  :  92  :  •  :  Qu]?- 


(3.12) 


By  (3.10)  and  (3.11),  the  variance-covariance  matrix  of  u  is  of  the  form 


Var(u)  =  diag^6jlmj,  •  •  ^i4m„)  +  Q> 


where 


(3.13) 
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G  =  QKQ', 


(3.14) 


and  Q  =  [Qj  =  Qo  :  •  =  Qt/J- 
Now,  the  vector  y  can  be  expressed  as  y 
the  direct  sum 


=  Dy,  where  y  is  the  vector  of  observations  in  (2.7)  and  D  is 


D  = 


(3.15) 


where  r  is  defined  in  (2.3)  and  T  is  the  set  of  at)  values  of  r.  The  matrix  D  is  of  order  cxN,  where  N 
is  the  total  number  of  observations.  Also,  the  residual  sum  of  squares  for  the  original  unbalanced 
model  in  (2.1)  can  be  written  as 


SSE-  Tj(y0  -  yr)2 


E  (y^*y r)2 

ks=l 


where,  if  we  recall,  0  =  (r,  ks).  Formula  (3.16)  can  be  rewritten  as 


(3.16) 


SSE=  y'Ry, 


(3.17) 


where 


R~r*T(inr  -  Jnr/nr)- 


(3.18) 


Lemma  3.5. 

(i)  R  is  idempotent  of  rank  N-c. 

(ii)  DR  =  0. 

(iii)  RXj  =  0,  i  =  1,  2,  ...,  u, 

-9- 


where  X^,  X0,  Xv  are  the  matrices  given  in  (2.7). 
Proof.  See  Appendix  B. 

From  Lemma  3.5  we  conclude  that 

DER  =  0, 


where  E  is  the  variance-covariance  matrix  of  y  in  (2.7),  which  is  equal  to 


(3.19) 


(3.20) 


Furthermore, 

RE/tr?  =  R. 


(3.21) 


It  follows  from  (3.19)  and  (3.21)  that  y  =  Dy  is  independent  of  SSg  and  that  has  the  chi- 

square  distribution  with  N-c  degrees  of  freedom,  the  rank  of  R. 

Since  R  is  idempotent  of  rank  N-c,  then  it  can  be  written  as 

R  =  <;AC',  (3.22) 


where  C  is  an  orthogonal  matrix  and  A  is  a  diagonal  matrix  with  N-c  ones  and  c  zeros.  By  the 
assumption  in  (2.6),  A  can  be  partitioned  as 


where 


A  =  diag(l^,  0), 

^ 1  =  c- 1 

=  N  -  2c+l>0, 


(3.23) 


(3.24) 


and  0  in  (3.23)  is  a  zero  matrix  of  order  cxc.  Accordingly,  the  matrix  C  in  (3.22)  can  be  partitioned 


as  C  =  |^C  j  :  C<>  :  C3J,  where  Qo’  and  Cj  are  of  orders  Nx£p  Nx(„  and  Nxc,  respectively.  We 
then  have 

R  =  CjC'j  +  C2C'2.  (3.25) 

Note  that 


C|Cj  =  1,  i  =  1,  2,  3; 
C{C.,  =  0,  i#'. 

1  i 


(3.26) 


Let  us  now  define  the  random  vector  u  as 


w  =  u  +  (^maxl^  * 


(3.27) 


where  G  is  the  matrix  in  (3.14)  and  Amax  is  its  largest  eigenvalue,  ^AmaxI^  -  G^5  is  a  symmetric 
matrix  with  eigenvalues  equal  to  the  square  roots  of  the  eigenvalues  of  AmaxI^  -  G,  which  are 
nonnegative.  Let  w  be  partitioned  just  like  u  in  (3.12)  as 


w  =  ^Wj,  u>2,  ...,  (3.28) 

where  u»j  is  of  order  m^xl  and  mj  is  the  number  of  rows  Qj  (i=l,2,...,i/)  in  (3.12).  The  distributional 
properties  of  the  w.’ s  are  given  in  the  following  lemma: 

Lemma  3.6. 

(i)  E(wj)  =  0,  i  =  1,2 . i/. 

(ii)  wj,  u2,  wj/  are  independently  distributed  as  normal  random  vectors  with  Wj  having 
the  variance-covariance  matrix 

-11- 


Var(wj)  —  +  ^max^rjlmjt  *  —  1*2,...,  u, 

where  6j  is  defined  in  (3.9). 


(3.29) 


(2)  (2) 

(iii)  Wj,  u»2,  ....  wj/  are  independent  of  SSg  ,  where  SSg  =  is  the  portion  of  the 

residual  sum  of  squares,  SSg,  in  (3.17),  which  corresponds  to  the  matrix  C2  in  (3.25). 


Proof. 


(i)  From  (2.11)  and  Lemma  3.4,  E(Qjy)  =  QjHq/^q  =  Qjlc^o  =  0  for  i  =  1,  2,  ...,  v.  Hence, 
E(u)  =  0  by  (3.12).  From  (3.18)  we  also  have  that  R 1  ^  =  0,  which  by  (3.25)  can  be 


rewritten  as 


(CiCj  +  —  0. 


(3.30) 


Using  (3.26)  in  (3.30)  we  get  Cjlj^  =  0.  It  follows  from  (2.7)  that  E(Cjy)  =  = 

0.  The  mean  of  w  in  (3.27)  is  therefore  equal  to  zero. 

(ii)  It  is  clear  that  u  is  normally  distributed.  Now,  the  vector  u  in  (3.27)  is  independent  of 
Cjy.  To  show  this,  we  note  from  (3.19),  (3.25),  and  (3.26)  that 


DECj  =  0. 


(3.31) 


Hence. 


Cov(y,  y'Cj)  =  DECj  =  0, 

since  y  =  Dy.  It  follows  that  y,  and  hence  u  in  (3.12),  is  independent  of  Cjy. 
The  variance-covariance  matrix  of  u  can  therefore  be  expressed  as 

Var(w)  =  Var(u)  +  ^  AmaxI^  -  O^CjECj^AmaxI^  -  G^. 

But,  from  (3.21)  we  have 


(3.32) 


1 


Also,  from  (3.25),  (3.26),  and  (3.33)  it  can  be  verified  that 


C'jECj  =  . 

From  (3.32)  and  (3.34)  we  then  have 

Var(w)  =  Var(u)  +  <r^AmaxI^  -  g). 
By  using  (3.13)  in  (3.35)  we  get 


Var(w)  = 


maxut 


(3.34) 


(3.35) 


(3.36) 


From  (3.36)  we  conclude  that  u>2»  ...»  w( ,  are  independent  and  that  Var(wj)  has  the 
form  described  in  (3.29). 

(iii)  SSg^  is  independent  of  u  ^since  SS^  is  independent  of  y  by  (3.19)^  and  is  also 
independent  of  Cjy.  This  is  true  because 


(3.37) 


(2) 

which  follows  from  (3.25),  (3.26),  and  (3.33).  Consequently,  SSg  and  w  in  (3.27)  are 
independent. 

From  Lemma  3.6  we  conclude  that  if  SSj  =  (i=l, 2, ...,»'),  then  the  sums  of  squares,  SSj, 

SS2,  ..,  SSj/  are  independent  and  SSj/^5j-f-Amax<r^  j  is  distributed  as  a  central  chi-square  variate 

(2)  o 

with  nrij  degrees  of  freedom  (i=l, 2, ...,»/).  Furthermore,  the  SSj’s  are  independent  of  SSg  /<r*,  which 
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has  the  chi-square  distribution  with  £0  degrees  of  freedom,  where  £0  is  defined  in  (3.24). 

(2) 

It  follows  that  SSj,  SS0 . SSj,  and  SSg  act  like  sums  of  squares  in  an  ANOVA  table  for  a 

balanced  random  model.  In  other  words,  the  analysis  concerning  the  variance  components, 
crj,  cr^,  ....  <?%,,  can  proceed  using  these  sums  of  squares  just  like  in  a  balanced  data  situation. 

In  particular,  if  the  data  set  is  balanced,  that  is,  if  nr  =  n,  then  K  =  Ic/n  (see  (2.17))  and  G 
=  (^  /n,  where  G  and  £  ^  are  defined  in  (3.14)  and  (3.24),  respectively.  Consequently,  Amax,  the 
largest  eigenvalue  of  G,  is  equal  to  1/n.  The  vectors  u>  and  u  in  (3.27)  are  therefore  identical.  In  this 


case. 


*/*  =  u'u  =y'(  f  Q[Qi)y  (by  (3.12)) 

=  y^Ic  -  Jc/c)y  (by  Lemma  3.4) 

=  y'(  £  Pj)y  (by  Lemma  3.1(iii)  and  the  fact  that  Pq  =  Jc/c  by  formula  (3.2)). 


Thus, 


,£,ssi  = 

1=1  1=1 


Now,  from  (3.2)  we  can  write 


y'P;y  =  y 


=  ?'[!,( Vbj)(  <2-14>) 


(3.38) 


Since  y  =  (lc®in)y/ni  formula  (3.38)  can  be  expressed  as 
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S3 


?'Pi?  =  k'  .|0i^{( ?• 


(3.39) 


Formula  (3.39)  shows  that  ny^y  is  the  usual  sum  of  squares  for  the  i1^  effect  in  a  balanced  model  of 
the  form  given  in  (2.1).  In  other  words,  nSSj,  nSS2*  . ..,  and  nSS„  reduce  to  the  usual  balanced 
ANOVA  sums  of  squares  associated  with  the  corresponding  u  effects  whenever  the  data  set  is  balanced. 


4.  Power  of  the  exact  tests 


Power  values  for  the  exact  tests  in  Section  3  can  be  easily  obtained  just  like  in  a  balanced 
model  situation.  As  in  Khuri  and  Littell  (1987),  it  is  easy  to  show  that  such  power  values  are 
monotone  decreasing  with  respect  to  Amax,  the  largest  eigenvalue  of  the  matrix  G  in  (3.14).  Upper 
and  lower  bounds  on  Amax  are  given  by  the  double  inequality 


5  S  ipr  <  A| 
c  rcT 


‘max  5: 


_ 1 _ 

min(nr)’ 


where  nr  is  the  frequency  of  the  r-cell  ^see  (2.3)^,  and  T  is  the  set  of  all  values  of  r.  The  proof  of 
(4.1)  is  similar  to  the  one  given  in  Lemma  2  in  Khuri  and  Littell  (1987)  and  will,  therefore,  be  omitted. 
VVe  note  that  the  lower  bound  in  (4.1)  is  the  reciprocal  of  the  harmonic  mean  of  the  r-cell  frequencies. 


5.  A  numerical  example 

An  example  is  given  in  Milliken  and  Johnson  (1984,  p.  264)  of  a  study  concerning  the  efficiency 
of  workers  in  assembly  lines  at  several  plants.  Three  plants  were  randomly  selected.  Four  assembly 
sites  and  three  workers  were  randomly  selected  in  each  plant.  For  convenience,  the  efficiency  scores  are 
reproduced  in  Table  1. 

The  model  for  this  experiment  is 


>*»u 

L 


y»jk£  =  "  +  ai  +  /»i(j)  +  yi{k)  +  C^>)i(jk)  +  <ijk£’ 

where  Qj  is  the  effect  of  the  i^  plant  (i=  1,2,3),  >s  the  effect  of  the  site  within  the  i1^1  plant 
(j=l,2,3,4),  is  the  effect  of  the  k1*1  worker  within  the  i1*1  plant  (k=l,2,3),  (^7);^)  is  the 
interaction  effect  of  sites  and  workers  within  plant  i,  and  is  the  error  term.  The  variance 

9  9  9  9  9 

components  are  a q,  cry,  and  07,  respectively. 

To  facilitate  the  understanding  of  the  application  of  the  exact  testing  procedure  to  this 
example,  the  reader  is  referred  to  Table  2  which  lists  the  values  of  some  key  quantities  used  in  the 
development  of  the  exact  tests.  The  expected  mean  square  values  of  MSj  =  SSj/rm  (i=l,2,3,4)  and 
=  SS^/^  are  given  in  Table  3. 

From  Tables  2  and  3  it  can  be  seen  that  the  value  of  the  exact  F-statistic  for  testing  the 

9  9  (9) 

hypothesis  HQ:  <r^  =  0  versus  Ha:  «r^  >  0  is  F  =  MS4/MS'E  =  7.090  with  18  and  47  degrees  of 
freedom.  The  significance  level  is  3.4x10  ®.  The  second  hypothesis  to  be  tested  is  Hq:  =  0  versus 

9 

Ha:  Vp  >  0-  The  corresponding  value  of  the  F-statistic  is  F  =  MS2/MS4  =  .994  with  9  and  18 
degrees  of  freedom.  This  is  a  nonsignificant  test.  Next,  the  value  of  the  F-statistic  for  the  hypothesis 
H^:  =  0  is  F  =  MSj/MS4  =  3.293  with  6  and  18  degrees  of  freedom.  The  level  of  significance  in 

this  case  is  .023. 

Finally,  the  testing  of  the  hypothesis  Hq:  =  0  versus  Ha:  ojj  >  0  requires  the  use  of 

Satterthwaite’s  procedure  since  no  mean  square  exists  in  Table  3  with  an  expected  value  equal  to  that 
of  MSj  under  Hq.  The  test  statistic  in  this  case  is  given  by 


F  ”  MS2  +  MSj  -  \IS4  "  5184, 


which  is  approximately  distributed  as  an  F  random  variable  with  2  and  i]  degrees  of  freedom,  where 


=  5.477. 


(ms2  +  ms3  -  ms4)2 

n  ~  (MS2)2/9  +  (MS3)2/6  +  (MS4)2/18 
The  level  of  significance  is  .055. 


Appendix  A 

This  appendix  gives  the  proof  of  Lemma  3.4. 


Proof. 


(i)  From  (3.2),  Pq  =  Ag/bg  and  bg  =  c  by  (3.3)  since  ^g  is  the  empty  set,  hence  ti’g  =  r. 

But,  by  (2.14),  Ag  =  Jc.  It  follows  that  Pg  =  Jc/c.  Consequently,  Qg  =  lc/>fc. 

(ii)  QiQ(  =  Im..  This  follows  by  the  definition  of  Q-.  Furthermore,  Qj  =  VjPj  for  some 
matrix  y.(  of  order  mjxc  (i=0,l,...,j/).  Since  PjP.,  =  0  for  i  then 

9i9'/  =  Wj»Y|r  =  o. 

(iii)  From  (3.4)  and  (3.5)  it  can  be  seen  that  AjQ?  =  AjPjVj  =  0,  if  il> j  £  0j,  and 
A  Q.'  =  A-P  Vf  =  b-P.V?  =  b.Q?,  if  C 

-j?i  -j- 1 :  i  j  - 1  - 1  j“i’  *i 


Appendix  B 


This  appendix  gives  the  proof  of  Lemma  3.5. 


Proof. 


(i)  This  is  straightforward. 

(ii)  DR  =  ^©T(lnr/nr)j!N  "rfT(-nr/n0] 


(by  (3.15)  and  (3.18)) 


=r®T(lnr/nr)  -rfT(inr/nr)  -  Q- 


rcT' 
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(iii)  The  matrix  Xj  can  be  partitioned  into  c  submatrices  that  correspond  to  the  values  of 
t  in  (2.3).  The  submatrix  corresponding  to  a  particular  r  is  of  order  nrxcj,  where  Cj  is  the  number  of 
columns  of  Xj  (i=l,2,. . „i/).  Let  us  denote  such  a  submatrix  by  Ur.  Each  column  of  Ur  consists  of 
either  nr  zeros  or  nr  ones.  Therefore.  RXj  can  be  partitioned  into  c  submatrices  of  orders  nrxcj  for 
the  different  values  of  r.  For  a  particular  r,  the  corresponding  submatrix  is  of  the  form 

(lnr  •  -?nr/nr)yr  =  Ur  •  Ur  =  Q, 

by  the  property  of  Ur  described  earlier.  It  follows  that  RXj  =  0  for  i=l,2, ...,»/. 
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Table  1 


Data  for  the  numerical  example 


Table  2 


Some  key  quantities  used  in  the  development  of  the  exact  tests  for  the  numerical  example 


Quantity 


Formula  cited _ Corresponding  value 


V 

(2.1) 

4 

T 

(2-3) 

(U,k) 

C 

(2.5) 

36 

N 

(2.6) 

118 

b0 

(3.3) 

36 

bi 

(3.3) 

12 

b2 

(3.3) 

3 

b3 

(3.3) 

4 

b4 

(3.3) 

1 

(3.2) 

(l3®Jl2)/12  -  Jjg/36 

e2 

(3.2) 

(ll2®Jj)/3  -  (l3®J12)/12 

?3 

(3.2) 

(!3®J4®!3)/4  -  (l3®J12)/12 

(3.2) 

^36  ‘  0l2®^)/3  ’  (j3®-4®-3 

mi 

(3.29) 

2 

m2 

(3.29) 

9 

m3 

(3.29) 

6 

m4 

(3.29) 

18 

(3.9) 

12<r3  +  3<r^+4<T^+<7^ 

62 

(3.9) 

h 

(3.9) 

4<4+<^7 

6a 

(3.9) 

<1 

(3.24) 

35 

<2 

(3.24) 

47 

^max 

(3.27) 

l 

SSj  =  w'jwj 

Lemma  3.6 

1265.96 

SS2  =  <h’2"2 

Lemma  3.6 

332.313 

SSj  =  WjWj 

Lemma  3.6 

733.949 

S%=  *4*4 

Lemma  3.6 

668.634 

Table  3 


Expected  mean  square  values  for  the  numerical  example 


Mean  square 


Expected  value 


MSt  =  SSj/2 
MS2  =  SS9/9 
MSg  =  SSg/6 
MS4  =  SS4/18 
MS^2)=  SS(e2)/47 


12<Tq[  -t-  3<r^  +  A<jy  +  +  <rjf 

3<r^  +  a2^  +  <r 

4<t7  + 

ff20y  +  °2 
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